CDS

Accession Number TCMCG075C10668
gbkey CDS
Protein Id XP_007039470.2
Location join(31248838..31248887,31248992..31249055,31249124..31249169,31249310..31249443,31249532..31249605,31249742..31249814,31250307..31250383,31250587..31250677,31251034..31251160,31251251..31251355,31251810..31251939,31252051..31252102,31252191..31252235,31253136..31253234,31253344..31253496)
Gene LOC18606020
GeneID 18606020
Organism Theobroma cacao

Protein

Length 439aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_007039408.2
Definition PREDICTED: uncharacterized protein At4g17910 isoform X2 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category F
Description protein At4g17910 isoform X1
KEGG_TC -
KEGG_Module M00065        [VIEW IN KEGG]
KEGG_Reaction R05918        [VIEW IN KEGG]
KEGG_rclass RC00004        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K05283        [VIEW IN KEGG]
EC -
KEGG_Pathway ko00563        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
map00563        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGATTCACTACCAAGATCTTTCAATGCCAACAAGCACCTCAAAGAACAATTCGTCAGCAATTTGACGGGATCGTCGATGCTAGAAATCTCCGCGCTTTTGACCACTGTCCCTATTCTAGTACTTTTGCGGCCGTCCATCTGTTTTCAAGCCCTAACTGATGGTGATACCAAAGAGACCTCTTTAAAGAAAAATGATACTGCAATCGTTGCTTTTAAGAATTTAAAGGCTTACCTAGCCACATTAGTTATGGATTCTGTCTTCATTGTTCTTCCCACACTTTTACTTTTCACTGTTCTAGCTGAGTGGATATATGTATGGATGATTTTGTTATCGTTGTTGCTGATTTTCGTTATTGCAGGCAAAAGATCTCCTCATTCGCCTTACTTGGAAGGACCTAAATCTTTTAGGATGAGTATATCATCGTATAGGGTTGCTATGATGGACCTTGGAGTTGGCTCCTTTGTACTAGTGAATTCAATTGTTTCACGGCAAGCGCGAAATGTCTCATCATCAATGGATTGGTGGAAGGCAGCCCTTAAATCTACGAGTCCACTACTACTGCTAGGATTTGCTAGACTTGTTTCTACAATGAGTGTGGACTATCAGGTACATGTGGGGGAATATGGAGTACACTGGAATTTCTTTTTCACACTTGCTGGTGTATCTATCCTTACATCCACAGTAAATGTTCCCTCAAAATATTCTGGAATTCTTGGTTCAGTAATTTTAGTTGGGTACCAAAGTTGGTTGAGCAGTGGGTTAAATGTGTATCTTCTTTCTAACAAAAGGGGAATGGATATCATAAGCAGAAACAAGGAGGGAATTTTTAGCATATTTGGATACTGGGGTATGTATCTTATTGGTGTTCAGGTGGGCTACTACCTCTTCTTTGGAAATCATTCCTCTGTCATGCTGAGAAGCAACAATGGAACACGAATTAGAGTCTGGCTTCTTTCTATTCTGTTTTGGATTCTAACTGTGCTTCTAGACAGGCATGTTGAAAGAATTTCACGTAGAATGTGCAACCTGCCTTATGTTACTTGGGTGCTGGCTCAAAATCTGCAGCTTTTAGCAATACTAATGCTTTCTGATTATGTCCCTGGGAGCAAAATGTCAGCTCTTGAAAAGGCATTTGATCGGAATTTATTGGCTTCCTTTCTGCTGGCTAATGTGCTCACAGGGCTGGTAAACTTGTTTGTGGATACGCTGTTTGCCTCCTCAGTATCAGCTCTTCTAATCCTGATTTCCTATGCTTTGACTTTGTCTGTTGTCATGGGAATAGTAGATTTTTATGGTGTTAGGTTAAAATTTTGGTAG
Protein:  
MDSLPRSFNANKHLKEQFVSNLTGSSMLEISALLTTVPILVLLRPSICFQALTDGDTKETSLKKNDTAIVAFKNLKAYLATLVMDSVFIVLPTLLLFTVLAEWIYVWMILLSLLLIFVIAGKRSPHSPYLEGPKSFRMSISSYRVAMMDLGVGSFVLVNSIVSRQARNVSSSMDWWKAALKSTSPLLLLGFARLVSTMSVDYQVHVGEYGVHWNFFFTLAGVSILTSTVNVPSKYSGILGSVILVGYQSWLSSGLNVYLLSNKRGMDIISRNKEGIFSIFGYWGMYLIGVQVGYYLFFGNHSSVMLRSNNGTRIRVWLLSILFWILTVLLDRHVERISRRMCNLPYVTWVLAQNLQLLAILMLSDYVPGSKMSALEKAFDRNLLASFLLANVLTGLVNLFVDTLFASSVSALLILISYALTLSVVMGIVDFYGVRLKFW